Automatic Utterance Segmentation in Spontaneous Speech

نویسندگان

  • Norimasa Yoshida
  • Peter Gorniak
چکیده

As applications incorporating speech recognition technology become widely used, it is desireable to have such systems interact naturally with its users. For such natural interaction to occur, recognition systems must be able to accurately detect when a speaker has finished speaking. This research presents an analysis combining lower and higher level cues to perform the utterance endpointing task. The analysis involves obtaining the optimal parameters for the signal level utterance segmenter, a component of the speech recognition system in the Cognitive Machines Group, and exploring the incorporation of pause duration and grammar information to the utterance segmentation task. As a result, we obtain an optimal set of parameters for the lower level utterance segmenter, and show that part-of-speech based N-gram language modeling of the spoken words in conjunction with pause duration can provide effective signals for utterance endpointing. Thesis Supervisor: Deb Roy Title: Assistant Professor of Media Arts and Sciences

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Input Segmentation of Spontaneous Speech in JANUS: A Speech-to-speech Translation System

JANUS is a multi-lingual speech-to-speech translation system designed to facilitate communication between two parties engaged in a spontaneous conversation in a limited domain. In this paper we describe how multi-level segmentation of single utterance turns improves translation quality and facilitates accurate translation in our system. We define the basic dialogue units that are handled by our...

متن کامل

Input Segmentation of Spontaneous Speech inJANUS : a Speech - to - speech Translation

JANUS is a multilingual speech-to-speech translation system designed to facilitate communication between two parties engaged in a spontaneous conversation in a limited domain. In this paper we describe how multi-level segmentation of single utterance turns improves translation quality and facilitates accurate translation in our system. We deene the basic dialogue units that are handled by our s...

متن کامل

اثر طول گفته بر روانی گفتار خودانگیخته کودکان و بزرگسالان لکنتی فارسی زبان

Objective: recently, researchers have increasingly turned to study the relation between stuttering and utterance length. This study investigates the effect of utterance length on the amount of speech dysfluency in stuttering Persian-speaking children and adults in conversational speech. The obtained results can pave the way to reach a better understanding of stuttering of child and adults, as w...

متن کامل

Utterance segmentation and turn-taking in spoken dialogue systems

A widely used method for finding places to take turn in spoken dialogue systems is to assume that an utterance ends where the user ceases to speak. Such endpoint detection normally triggers on a certain amount of silence, or non-speech. However, spontaneous speech frequently contains silent pauses inside sentence-like units, for example when the speaker hesitates. This paper presents /nailon/, ...

متن کامل

Separation of non-spontaneous and spontaneous speech

There are many situations in which it is desirable to be able to distinguish spontaneous speech and speech which is non-spontaneous. Examples of situations in which this problem may arise include forensic evidence situations, sorting voice-mail responses from voice-mail menus, and automatic segmentation of spontaneous responses from prepared questions. The later situation can occur if it is des...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014